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Abstract 

In practice, the information regarding the population proportion possessing certain 
attribute is easily available, see Jhajj et.al. (2006). For estimating the population mean Y 
of the study variable y, following Bahl and Tuteja (1991), a ratio-product type 
exponential estimator has been proposed by using the known information of population 
proportion possessing an attribute (highly correlated with y) in simple random sampling. 
The expressions for the bias and the mean-squared error (MSE) of the estimator and its 
minimum value have been obtained. The proposed estimator has an improvement over 
mean per unit estimator, ratio and product type exponential estimators as well as Naik 
and Gupta (1996) estimators. The results have also been extended to the case of two 
phase sampling. The results obtained have been illustrated numerically by taking some 
empirical populations considered in the literature. 
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1. Introduction 

In survey sampling, the use of auxiliary information can increase the precision of 
an estimator when study variable y is highly correlated with the auxiliary variable x. but 
in several practical situations, instead of existence of auxiliary variables there exists some 
auxiliary attributes, which are highly correlated with study variable y, such as 
(i) Amount of milk produced and a particular breed of cow. (ii) Yield of wheat crop and 

a particular variety of wheat etc. (see Shabbir and Gupta (2006)). 

In such situations, taking the advantage of point biserial correlation between the 
study variable and the auxiliary attribute, the estimators of parameters of interest can be 
constructed by using prior knowledge of the parameters of auxiliary attribute. 

Consider a sample of size n drawn by simple random sampling without 
replacement (SRSWOR) from a population of size N. let y; and @, denote the 


observations on variable y and @ respectively for the i” unit (i =1,2,..., N). We note that 


o,= 1, if i" unit of population possesses attribute o and o,= 0, otherwise. Let 


N n 
A= Yo, and a= Yo, denote the total number of units in the population and sample 
i=l 


i=l 
F ; ; A a : ‘ 
respectively possessing attributed. Let P= a and p =—denote the proportion of units 
n 


in the population and sample respectively possessing attribute 6 . 


In order to have an estimate of the population mean Y of the study variable y, 
assuming the knowledge of the population proportion P, Naik and Gupta (1996) defined 
ratio and product estimators of population when the prior information of population 
proportion of units, possessing the same attribute is available. Naik and Gupta (1996) 


proposed following estimators: 


19 


Pe (=| (1.1) 
p 


ie (2) (1.2) 


The MSE of t; and t, up to the first order of approximation are 


MSE(t,)= f,Y?[C? +C2(.-2K,)| (1.3) 
MSE(t,)=f,¥?|C? +C2(1+ 2K, )| (1.4) 
S? S; 1 1 C, 
= o = _ 
where; = =,C) = orf, =—-=.K, = Pp oS qah-¥ 


== —)'(6, -P)*, Sy = as salEve. ns 


DO» = Sy 
» SS 


¥. 


is the point biserial correlation coefficient. 


Following Bahl and Tuteja (1991), we propose the following ratio and product 


exponential estimators 


t, =yeup 22 | (1.5) 
P+p 

Ly vers P=) (1.6) 
p+P 


2. Bias and MSE of ts and ty 


To obtain the bias and MSE of ts to the first degree of approximation, we define 
e We, -0-*), therefore E (ej) = 0. i=(y, 0), 


E(e;) =£,C,, E(e,) =£,C; ,E(e,e,) =f,p,C,C,. 
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Expressing (1.5) in terms of e’s, we have 


PP +e,) 
P+P(1+e,) 


t,=Y(i+ se] 


_¥ aS 
- Yue, Joy (2.1) 


Expanding the right hand side of (2.1) and retaining terms up to second powers of e’s, we 


have 


al e, e e 
u=Flee, aor a (2.2) 


Taking expectations of both sides of (2.2) and then subtracting Y from both sides, we get 


the bias of the estimator t3 up to the first order of approximation, as 


B(t,)=£Y2(4-K, (2.3) 


From (2.2), we have 
ee ad e 
(t,-Y)= ¥/*s “| (2.4) 


Squaring both sides of (2.4) and then taking expectations we get MSE of the estimator ts, 


up to the first order of approximation as 
MSE(t,) = £,¥2| C2 +C2(2-K 25 
(t,) = 1 er i ) ( : ) 
To obtain the bias and MSE of t, to the first degree of approximation, we express (1.6) in 


terms of e’s 
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= P(l+e,)—P 
t, = Y@+e,)exp|) ————_ (2.6) 

: P(l+e,)+P 

and following the above procedure, we get the bias and MSE of ty as follows 
ee Oly | 
B(t,)=£¥2(5+K, (2.7) 
= it 

MSE(t,) =£Y"|C} GK) (2.8) 


3. Proposed class of estimators 

It has been theoretically established that, in general, the linear regression 
estimator is more efficient than the ratio (product) estimator except when the regression 
line of y on x passes through the neighborhood of the origin, in which case the 
efficiencies of these estimators are almost equal. Also in many practical situations the 
regression line does not pass through the neighborhood of the origin. In these situations, 
the ratio estimator does not perform as good as the linear regression estimator. The ratio 
estimator does not perform well as the linear regression estimator does. 

Following Singh and Espejo (2003), we propose following class of ratio-product 


type exponential estimators: 


t. = oc cxf — | +(1-a) on( 22 (3.1) 
+p p+P 


where « is areal constant to be determined such that the MSE of ts is minimum. 


_ 
For a=1, ts reduces to the estimator t, = yeu 22 | and fora= 0, it reduces to 
+?p 
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Bias and MSE of ts: 


Expressing (3.1) in terms of e’s, we have 


tee Y(1+ so ol; = P(L+ =a +(1-a) op pored—t 


P+P(lt+e, ) P(l+e,)+P 


= —e e 
= Y(1 ae, Je exo) 4} +(1-a) oof | (3.2) 


Expanding the right hand side of (3.2) and retaining terms up to second powers of e’s, we 


have 
2 
= e, eG; 
t.=Y Dy Fa FE6e an PER eyes (3.3) 


Taking expectations of both sides of (3.3) and then subtracting Y from both sides, we get 


the bias of the estimator ts up to the first order of approximation, as 
zCe 1 
B(t.) =f, Y Bt Pe yGp aie (3.4) 
From (3.3), we have 
SS il 
(t, -Y)= ye re(5-«) (3.5) 


Squaring both sides of (3.5) and then taking expectations we get MSE of the estimator ts, 


up to the first order of approximation as 
W2 2 2 1 2 dt 
MSE(t,) =f, Y°| Cy +C; a —a |+2p,,C,C, a (3.6) 
Minimization of (3.6) with respect to a yields optimum value of as 


2K, +1 
a= 


= a, (Say) (3.7) 
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Substitution of (3.7) in (3.1) yields the optimum estimator for ts as (ts)opt (Say)with 


minimum MSE as 

min.MSE(t,)= f,¥°C2(1—p3, )=M(ts)opt (3.8) 
which is same as that of traditional linear regression estimator. 
4. Efficiency comparisons 


In this section, the conditions for which the proposed estimator ts is better than y, ti, 
to, t3, and ty have been obtained. The variance of y is given by 
var(y) =f, Y °C} (4.1) 
To compare the efficiency of the proposed estimator t; with the existing estimator, from 


(4.1) and (1.3), (1.4), (2.5), (2.8) and (3.8), we have 


var(y)— M(t;), =p5, 2 0. (4.2) 

MSE(t,)—M(t,)) =(C, —p,,C,}° 20. (4.3) 

MSE(t,)—M(ts)o =(C, +PyC,) 20. (4.4) 
ee : 

MSE(t,) -M(t;), = Spa >0. (4.5) 
@ ° 

MSE(t,)-M(t,), = [Seq >0. (4.6) 


Using (4.2)-(4.6), we conclude that the proposed estimator ts outperforms Y, ti, ta, ts, and 
ty. 

5. Empirical study 

We now compare the performance of various estimators considered here using the 


following data sets: 
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Population 1. [Source: Sukhatme and Sukhatme (1970), p. 256] 


y = number of villages in the circles and 


o = A circle consisting more than five villages. 


N = 89, Y =3.360, P = 0.1236, p,, = 0.766, Cy = 0.60400, Cp = 2.19012. 


Population 2. [Source: Mukhopadhyaya, (2000), p. 44] 


Y= Household size and 


o = A household that availed an agricultural loan from a bank. 


N= 25, Y =9.44, P = 0.400, P » = -0.387, Cy = 0.17028, C, = 1.27478. 


The percent relative efficiency (PRE’s) of the estimators Y, t-t4 and (ts)op: with respect to 


unusual unbiased estimator y have been computed and compiled in table 5.1. 


Table 5.1: PRE of various estimators with respect toy. 


Estimator PRE’s (.,y) 
Population 
I II 

y 100 100 

t 11.63 1.59 

to 5.07 1.94 

ts 66.24 5.97 

ta 14.15 8.24 

(ts)o 241.98 117.61 
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Table 5.1 shows that the proposed estimator t; under optimum condition performs 


better than the usual sample mean y, Naik and Gupta (1996) estimators (t; and tz) and 


the ratio and product type exponential estimators (t3 and t4). 


6. Double sampling 
In some practical situations when P is not known a priori, the technique of two-phase 


sampling is used. Let p’ denote the proportion of units possessing attribute in the first 
phase sample of size n’; p denote the proportion of units possessing attributed in the 
second phase sample of size n < n’ and y denote the mean of the study variable y in the 


second phase sample. 


When P is not known, two-phase ratio and product type exponential estimator are 


given by 
t,= yero{ PP (6.1) 
pt+p 
t, =yens{ PE (6.2) 
p+p 


To obtain the bias and MSE of t, and t7, we write 
y=Y(l+e,), p=P(ite,), p'= P(i+ e', ) 
such that 
E (ey) = Ele, )= Ele’, )=0. 
and 


E(e?)=f,C2, E(e?)=£,C2, Ele',)’ =£,C2, Ele,e',)=f.P,.C,C, - 
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1 1 
where f, = —-— 
n 


N° 


Expressing (6.1) in terms of e’s, we have 


= P(l+e', )—P(i+e,) 
6 = Yee, esha )+ P+ = 


f. oe] 
= Y(1+e,)exp ; (6.3) 


Expanding the right hand side of (6.3) and retaining terms up to second powers of e’s, we 


have 


ast e, e, e. e ee, ee, ee 
t,=Yl1te, + be ig he bs Sd b (6.4) 
Z 8 8 4 2 2 
Taking expectations of both sides of (6.4) and then subtracting Y from both sides, we get 


the bias of the estimator ts up to the first order of approximation, as 


B(t,) =f, (-2K, (6.5) 


From (6.4), we have 


(.,-Y)= Y|es 4: ete 


(6.6) 


Squaring both sides of (6.6) and then taking expectations we get MSE of the estimator te, 


up to the first order of approximation as 


= C* 
MSE(t,) = vnc; v2 (-48,) (6.7) 


To obtain the bias and MSE of t7 to the first degree of approximation, we express (6.2) in 


terms of e’s as 
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‘ -Yure, ea e,)-P(lt+e', J 


P(l+e,)+P(1+e', ) 


_ |<: =| 
= Y(l+e, )exp Sar (6.8) 


Expanding the right hand side of (6.8) and retaining terms up to second powers of e’s, we 


have 


1 12 2 1 
ral e, @, e, e@ ee e 
t= Flee, +8 ps ey eS (6.9) 
ean: 


Taking expectations of both sides of (6.9) and then subtracting Y from both sides, we get 


the bias of the estimator t7 up to the first order of approximation, as 
2 


B(,)=f,.—" (+ 2K, ) (6.10) 


From (6.9), we have 


(t, -Y)= Ye, Seen (6.11) 


Squaring both sides of (6.11) and then taking expectations we get MSE of the estimator 


ty, up to the first order of approximation as 
= C. 
see) = 7A; oe sK,) (6.12) 


7. Proposed class of estimators in double sampling 


We propose the following class of estimators in double sampling 
t, = He exp 22 + (1-a,) on( 22] (7.1) 
p+p ptp 


where ©, is a real constant to be determined such that the MSE of tg is minimum. 
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For a,=1, ts reduces to the estimator t, = yey PP) and fora,= 0, it reduces to 
p+p 


Bias and MSE of ts: 


Expressing (7.1) in terms of e’s, we have 


t =Y(l+e Ql, ex Pi+e', )—P(l+ed) +(1-a yex Pd +e,)—Pa+e,) 
: ea P(l+e', ) + P(1+ed) ee P(it+e,)+P(+e', ) 


= Y(i+ oa Je es “es | +(1- aden = I (7.2) 


Expanding the right hand side of (7.2) and retaining terms up to second powers of e’s, we 


have 


— Bebe 


+ 0,658 ,—a,e,e, | (7.3) 


Taking expectations of both sides of (7.3) and then subtracting Y from both sides, we get 


the bias of the estimator tg up to the first order of approximation, as 
aC: 1 


From (7.3), we have 


(t, -Y)e= ¥°, -(« -> he [a i}. | (7.5) 
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Squaring both sides of (7.5) and then taking expectations we get MSE of the estimator tg, 


up to the first order of approximation as 


MSEC) = ¥" iC} +1 C%( Va | 2x, (7.6) 


Minimization of (7.6) with respect to a, yields optimum value of as 


2K 


Oy = 014) (Say) (7.7) 


Substitution of (7.7) in (7.1) yields the optimum estimator for tg as (tg)op (Say) with 


minimum MSE as 
min.MSE(t,) = Y°C2(f, —f,p2, )=M(ta)o, (say) (7.8) 
which is same as that of traditional linear regression estimator. 


8. Efficiency comparisons 


The MSE of usual two-phase ratio and product estimator is given by 
MSE(t,)= Y°|f,C? +£,C2(1-2K,)| (8.1) 
MSE(t,,)= Y°|f,C2 + f,C2(1+ 2K,)| (8.2) 


From (4.1), (6.7), (6.12), (8.1), (8.2) and (7.8) we have 


var(y) — M(t,)) = EP ip 20. (8.3) 
C 2 

MSE(t,)—M(te), =#( 9c, >0. (8.4) 
C 2 

MSE(t,) —M(t,), 1S +0n6,] >0. (8.5) 

MSE(ts)—M(t,), =£3(C, —pC, ) =0. (8.6) 
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MSE(t,,) —M(tg)o = (on +PypCy } 20. 


From (8.3)-(8.7), we conclude that our proposed estimator tg is better than Y, tg, t7, to, and 


tio. 


9. Empirical study 


The various results obtained in the previous section are now examined with the 
help of following data: 
Population 1. [Source: Sukhatme and Sukhatme( 1970), p. 256] 


N = 89, n’ = 45,n = 23, y = 1322, p = 0.1304, p’= 0.1333, p,, = 0.408, Cy = 0.69144, 


C= 2.7005. 


Population 2. [Source: Mukhopadhyaya( 2000), p. 44] 


N=25, n'=13,n=7, y = 7.143, p = 0.294, p’ = 0.308, p,, = -0.314, Cy = 0.36442, 


Cp =1.34701. 


Table 9.1: PRE of various estimators (double sampling) with respect to y . 


Estimator PRE’s (.,y) 
Population 
I II 
y 100 100 
te 40.59 25.42 
ty 21.90 40.89 
to 11.16 8.89 
tio 7.60 12,09 
(ts)o 112.32 106.74 


Table 9.1 shows that the proposed estimator tg under optimum condition performs better 


than the usual sample mean jy, te, tz, to, and tio. 
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